Gamma Boltzmann Machine for Audio Modeling
نویسندگان
چکیده
This paper presents an energy-based probabilistic model that handles nonnegative data in consideration of both linear and logarithmic scales. In audio applications, magnitude time-frequency representation, including spectrogram, is regarded as one the most important features. Such magnitude-based features have been extensively utilized learning-based processing. Since a scale terms auditory perception, are usually computed with function. That is, function applied within computation so learning machine does not to explicitly scale. We think different way propose restricted Boltzmann (RBM) simultaneously models linear- log-magnitude spectra. RBM stochastic neural network can discover representations without supervision. To manage scales, we define energy based on results conditional distribution (of observable data, given hidden units) written gamma distribution, hence proposed termed gamma-Bernoulli RBM. The was compared ordinary Gaussian-Bernoulli by speech representation experiments. Both objective subjective evaluations illustrated advantage model.
منابع مشابه
Gated Boltzmann Machine in Texture Modeling
In this paper, we consider the problem of modeling complex texture information using undirected probabilistic graphical models. Texture is a special type of data that one can better understand by considering its local structure. For that purpose, we propose a convolutional variant of the Gaussian gated Boltzmann machine (GGBM) [12], inspired by the co-occurrence matrix in traditional texture an...
متن کاملModeling Documents with a Deep Boltzmann Machine
We introduce a type of Deep Boltzmann Machine (DBM) that is suitable for extracting distributed semantic representations from a large unstructured collection of documents. We overcome the apparent difficulty of training a DBM with judicious parameter tying. This enables an efficient pretraining algorithm and a state initialization scheme for fast inference. The model can be trained just as effi...
متن کاملA Deep Incremental Boltzmann Machine for Modeling Context in Robots
Context is an essential capability for robots that are to be as adaptive as possible in challenging environments. Although there are many context modeling efforts, they assume a fixed structure and number of contexts. In this paper, we propose an incremental deep model that extends Restricted Boltzmann Machines. Our model gets one scene at a time, and gradually extends the contextual model when...
متن کاملTemporally Adaptive Restricted Boltzmann Machine for Background Modeling
We examine the fundamental problem of background modeling which is to model the background scenes in video sequences and segment the moving objects from the background. A novel approach is proposed based on the Restricted Boltzmann Machine (RBM) while exploiting the temporal nature of the problem. In particular, we augment the standard RBM to take a window of sequential video frames as input an...
متن کاملModeling pigeon behavior using a Conditional Restricted Boltzmann Machine
In an effort to better understand the complex courtship behaviour of pigeons, we have built a model learned from motion capture data. We employ a Conditional Restricted Boltzmann Machine (CRBM) with binary latent features and real-valued visible units. The units are conditioned on information from previous time steps to capture dynamics. We validate a trained model by quantifying the characteri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing
سال: 2021
ISSN: ['2329-9304', '2329-9290']
DOI: https://doi.org/10.1109/taslp.2021.3095656